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Signal synthesizing 1© fetf 1 9 OCT 2694 



This invention relates to the synthesizing of a first and a second output signal 
from an input signal. 



5 Within the field of audio coding, parametric audio coders have gained 

increasing interest. It has been shown that transmitting (quantized) parameters that describe 
audio signals requires only little transmission capacity and that they allow a decoding at the 
receiving end which results in an audio signal that perceptually does not significantly differ 
from the original signal. Hence, bit-rate savings may be obtained by only transmitting one 

1 0 audio channel combined with a parameter bit stream that describes the spatial properties of 
the stereo signal and, thus, allows a decoder to reproduce the spatial properties of the stereo 
signal. 

One of the above spatial parameters which is of importance for the coding of a 
stereo signal comprising an L channel and an R channel is the interchannel cross-correlation 
1 5 between the L and R channels. Hence, in many systems one of the signal parameters that are 
analysed by an encoder is the interchannel cross-correlation. The determined cross- 
correlation is then transmitted together with a mono signal from the encoder to a 
corresponding decoder. 

At the decoder two output signals are reconstructed which have the desired 
20 cross-correlation. Furthermore, it is desirable that the reconstruction only introduces little 
artifacts relative to the original stereo signal. 

Various methods of decorrelating signals are known as such. Fig. 1 illustrates 
a so-called Lauridsen decorrelator. The Lauridsen decorrelator comprises an all-pass filter 
101, e.g. a delay, which generates and possibly attenuates a delayed version of the waveform 
25 of the input signal x. The output H<8>x of the filter 101 is subsequently added (102) to the 
input resulting in the left channel L and subtracted (103) from the input resulting in the right 
channel R 

The above prior art decorrelator is very suitable as long as the two output 
signals are very similar or even equal in level. However, parametric audio coders also apply 
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level differences to the output signals, the so-called amplitude panning. The above 
decollator involves the problem that the perceptual quality of the generated signals 
deteriorates if the level differences are large. 



The above and other problems are solved by a method of synthesizing a first 
and a second output signal from an input signal, the method comprising: 
filtering the input signal to generate a filtered signal; 

obtaining a correlation parameter indicative of a desired correlation between 
the first and second output signals; 

obtaining a level parameter indicative of a desired level difference between the 
first and second output signals; and 

transforming the input signal and the filtered signal by a matrixing operation 
into the first and second output signals, where the matrixing operation depends on the 
correlation parameter and the level parameter. 

Hence, by performing a matrix operation which depends both on the desired 
correlation and the desired level difference, a significant increase in perceptual quality of the 
output signals of a parametric decoder is achieved. 

In a preferred embodiment, the matrixing operation comprises a common 
rotation by a predetermined angle of the first and second output signals in a space spanned by 
the input signal and the filtered input signal; and where the predetermined angle depends on 
the level parameter. 

Hence, By adding an additional rotation to the mixing operation, the relative 
level of the output signals may be controlled without influencing the cross-correlation 
between the output signals. 

In a further preferred embodiment, the predetermined angle is selected to 
maximize atotal contribution of the input signal to the first and second output signals. It is 
realized that the perceptual quality of the signal may be increased, if the amount of the 
filtered signal present in the output signals is minimized and, thus, the amount of the original 
signal is maximized. 

When the method further comprises scaling each of the first and second output 
signals to said desired level difference between the first and second output signals, it is 
ensured that the relative level of the output signals corresponds to the desired level according 
to a level parameter determined by the encoder. 
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In apreferred embodiment, the filtering of the input signal comprises all-pass 
filtering the input signal, e.g. a comb-filter. The spectral spacing of a comb-filter is uniformly 
distributed over frequency. Hence to be able to obtain a desired dense spacing of peaks and 
valleys at low frequencies, me delay of the Lauridsen decollator should be very large. This, 
5 however, has the disadvantage that at high frequencies, echos can be perceived for transient 
input signals. 

This problem may be solved when the all-pass filter comprises a frequency- 
dependant delay. At high frequencies, a relatively small delay is used, resulting in a coarse 
frequency resolution. At low frequencies, a large delay results in a dense spacing of the comb 
10 filter. 

The filtering may be performed on the full bandwidth of the signal. 
Alternatively, the filtering may be combined with a band-limiting filter, thereby applying the 
decollation to one or more selected frequency bands. 

The term matrix operation refers to an operation which transforms an input 
15 multi-channel signal into an output multi-channel signal where the components of the output 
multi-channel signal are linear combinations of the components of the input multi-channel 
signal. 

The present invention can be implemented in different ways including the 
method described above and in the following, arrangements for encoding and decoding, and 
20 further product means, each yielding one or more of the benefits and advantages described in 
connection with the first-mentioned method, and each having one or more preferred 
embodiments corresponding to the preferred embodiments described in connection with the 
first-mentioned method and disclosed in the dependant claims. 

It is noted that the features of the method described above and in the following 
25 may be implemented in software and carried out in a data processing system or other 
processing means caused by the execution of computer-executable instructions. The 
instructions may be program code means loaded in a memory, such as a RAM, from a storage 
medium or from another computer via a computer network. Alternatively, the described 
features may be implemented by hardwired circuitry instead of software or in combination 
30 with software. 

The invention further relates to an arrangement for synthesizing a first and a 
second output signal from an input signal, the arrangement comprising: 

filter means for filtering the input signal to generate a filtered signal; 
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^ansforobtair^^ 
between the first and second input signals; nce 

me ans for obtaining a level parameter indicative of a desared level Terence 
between the first and second input signals; and 

operation mte the first and second output signals, where the matinxing operation depends on 
the eorrelation parameter and the level parameter. 

^e invention mrmer relates to an apparatus for supplying a decoded audro 

signal, the apparatus comprising: 

an input unit for receiving an encoded audio signal; 

a decoder for decoding the encoded audio signal, the decoder comprising an 
arrangement for synthesis a first and a second audio signal as described above andinthe 

following; and . . 

an output unit for providing the decoded first and second audio signal. 

TTre invention further relates to a decoded multi-channel signal comprising a 
first andasecond signal component avnthesi^ from an mput signal by ti^forming the 
i.put signal andafilteredsignalbyamatiixingopetationintothefirst and 
components, where the fihered signal is genemted by filtering ti,e mpu, ai^ and where die 
n.auLngopcrationdependsonacorrelationpa.anieter indicative of a desued eorrelation 

leveldifferencebetweenthefirstandsecondinputsignala. 

^einventionfriruierrelatestoaatomgemediumbavingstoredthereonsucha 

decoded multi-channel signal. 
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TTrese and oflier aspects of the invention will be apparent and elucidated from 
fl^bodimentedeacnWmthefonowingw^^ 

fig 1 s howsapriorartLauridsendecorrelartor; 

fig 2 illustiates a decorrelator according to an embodiment of die invention; 
figs. 3a-c illustrate the signal generation according to an embodiment of the 

invention; , 
fig 4 schematically shows a system for spatial audio coding; and 
fig. 5 shows a schematic view of a system for communicating multi-channel 

audio signals; 
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Fig 2 illustrates a decollator according to an embodiment of the invent™. 

parametric audio eneoder whieh generates a mono audio signal x and a set of paramettsP 
ending an interchannel cross-correlation p and a parameter indicative of the channel 

ofaSc to oede,pha^c„mp>ex(see,g.M^ 

signals and binary s^^^^^^^^^^ 69 ' 
89 X970). The deflator tauter comprises an analysis circuit 202 that recervesthe spahal 

different c. The circuit 202 determines a mixing matrix M(a,P) as will be described nr 
formation circuit 203 which further™ tire input signal x and the filtered srgnal 
H®x. The eiieuit 203 performs a mixing operation according to 



(1) 



25 



resulting in the output signals L and R. 

Figs 3a-c illustrate the signal generation according to an embodunent of the 

invention. Wtol*^^**-*^***^*^™** 
signal H®x is represented by the vertical axis. As the two signals are uncorrelated they may 
be represented as orthogonal vectors spanning a two-dimensional space. 

The output signals Land Rare represented as vectors 301 and 302, 



the correlation between the signals L and R is given by 



30 



respectively. In this representation,! 
fceangleabetweenthe vectors 301 and 302 according to p=cos W , i.e. by the angular 

the correct angular distance has the specified correlation. 

Hence, a mixing matrix M which transforms the signals x and H®x into 
signals L and R with a predetermined correlation p may be expressed as follows: 
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(2) 

(cos(a/2) sirHa/a)^ 
M = lcos(-a/2) sin(-a/2)/ 

Tbi the amount of all-pass filtered signal depends on the desired c— 
„o,,the^^^ 
(but . tal S0^ 

where the matrix M is given by 

i e a -90- corresponding to uncorrected output signals(p=0). 

Jnorder.oiUusu.toaproblemwhhten.trixofeonC^weassun.ea 

^onwithane^eantpUtuoe panning — 

Wonnauonofe q n.( l )wiu,u,ernixing m anixo f e q n.(3) y ie l dsL = 1/^(x + H«x) 

^theoutput — — 
Ho^toisanundesiredsi^sn^theaU-passfilterusuaUy 

rr: 

SJi**.*-*-.*^— -fthetwo outputs^ wouid 
StiUbeZeK> - tn^ons^n^oron^leverdiff^^P^^ 

^elouder output — — ^J^.*^ 
output channel contains relatively more of the filtered signal. Hence, in together 
pined to maximize theamount of the original signal present m the two ou*uts togefiter, 
and to minimize the amount of the filtered signal. 

According to the invention, Una is achieved by introduce a different nnxmg 
matrix including an additional common rotation: 



M = C- 



fcos(p + a/2) sirKp + a/afi (4) 
Voos(B-a'2) sin(p-a/2)/ 
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Herepis an additional rotation, and C is a scaling matrix which ensures that 
the relative level difference between the output signals equals c, i.e. 
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1 + C 



1 + c 



Inserting the matrix of eqn. (4) in eqn. (1) yields the output signals generated 
by the matrixing operation according to the invention: 



1 + c 
0 



0 
1 

1 + c 



fcos(P + a/2) sin(P + a/2)\r x > 
cos(B-a/2) sin(p-a/2)J \H®x) 



This situation 



is illustrated in fig. 3b. The output signals L andR still have an 
the correlation between the L andR signals is not affected by the 
according to the desired level difference and the additional 



angular difference a, i.e. 

scaling of the signals L and R 
rotation by the angle p of both the Land the R signal. ' 

As mentioned above, preferably, the amount of the original signal x » the 
.andRshorndbemaxinuzed-msconditionmaybeusedtodeterrmne 



20 summed output of L a 
the angle p, according to 
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5(L + R) _ Q 
which yields the condition: 



ta n(p) = l^tan(a/2). 
i + c 

This situation is illustrated in fig. 3c, where the sum of the L and R 
componentsisalignedwiththedirectionofx. 

Fig 4 schematically shows a system for spatial audio coding. The system 

spatial attributes of a multi-channel audio signal by specifying an interaural level dafference, 
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inpinto several time/frequency slots, e.g. by time-windowing foUowed by a hansform 
OPerati011 ' tooneenabodtoen^eleftandrightincorningsignalsaxespUtupinvanons 
timeframes (e.g. 2048 s^nples at 44.1 kHz sampling rate) and windowed with a square-root 

fancies more bins are eombined than at lower frequencies. For example, FFT burs 
correspondingto approximately 1.8 ERBs (Equivalent Rectangular Bandwrdth) may be 

Subsequently,mmeanaly S iscircuit404,foreveryWrrequencyslot,the 

foUowing properties of the incoming signals are analyzed: 

The interaural level difference, or ILD, defined by fire relatrve levels of me 
correspondingband-limitedsignals stemming fiom the two inputs, 

The interaural time (or phase) difference (TTD or IPD), defined by the 

ftnehon, and ^ rf ^ mye{onDS mt ^ no t be accounted for by ITDs or 

ILDs .bichcanbepararneterizenby the maximum value of the cross-oorrelafion fimchon 
(ie theva 1 ueofmecross-correlationt^ction a tthepositio n of« 1 emaxmrumpea k ). 
25 The three parameters described above vary over time; however, since rt,s 

toownthatthebinaural auditory system is very staggUh in its processing, me update rate of 
these properties is rather low (typically tens of milliseconds). 

The analysis circuit 404 further generates a sum (or dominant) signal S 
comprisingacombination of the left and right siguals. Hence, the L and R signals are 

theparametersPcomprising the ILD, the ITMPD, and the maximum value of the cross- 
correlation function. 
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It is noted that parameter ILD in this embodiment is related to the channel 

ie ILD is proportional to the logarithm of c. 

to one embodiment, for each subband, the corresponding ILD, ITD and 

the Lss-correlation function at this peak is used as interaoralcorrelatronofthrssubband 
p^ILOissimpiycomputedby^gthepowermtiooftheleftandrightchannels 

^^^_ si ^ Smayte ^ bysommtag theleftandrightsubbands 
afteraphaseoorrection^oralaUgunrent^Tnisphaseco^ 

by appropriatemodiflcation of thephase angles of eachFFT bin. «en«y, the,™ 

complex conjugates a. negative fteouencies, (2) inverse FFT, (3) windowing, and (4) 
overlap-add. ^ ^ ^ ^ ^ ^ re(pted ^ 

25 for their transmission. . 

TTre sum signal S and the parameters P ate communicated to a decoder 405. 

comprises delay citcuits 407 and 408 wMoh delay each subband of the left s.gna^by^rD/2 

subbands according «o the IID for titatsubband and converts the ou^ut signals to the tune 
domain, e.g. by performing the Mowing steps: (D inserting complex conjugates at negahve 
ftequencies, (2) inverse FFT, (3) windowing, and (4) overlap-add. 
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Rg 5 shows a schematic view of a system** conmmnicating stereo audio 

may be any electronic equipment or part of such equrpment 

Here the term eleetronie equipment comprises computers, such as stauonary 
andpor^lePCs.starionaryandportableramoc^^ 

decoding device may be combined in one electronic equipment where audro srgnals are 
stored on a computer-readable medium for later reproduction. 

The coding device 501 comprises an input unit 511 for receivmg a stereo 
signa l, an encoder 502 for encoding a stereo audio signal including a left signal component L 

nncrophones.e.g.viaftu.her clonic equipment, such as a mixing equrpm^ e^e 
^mayfurtherbereceivedasanotdputftonranother^dio player, »~«"~« 

503 for transmitting the coded signd T via a communications channel 509 to thedecodutg 
device 505 . The transmitter 503 may comprise circuitry suitable for enabling the 

mtegrateu Services Digital Network (ISDN) adapter, a Digital Subscnber Lme (DSL) 
sL-rangecommum^onsl^such^ 



25 
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radio-based link. 
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Further examples of the communications channel include computer networks 
and wireless telecommunications networks, such as a Cellular Digital Packet Data (CDPD) 
network, a Global System for Mobile (GSM) network, a Code Division Multiple Access 
(CDMA)network,aTimeD^ u 
Radio service (GPRS) network, a Third Generation network, such as aUMTS network, or the 

like. 

Alternatively or additionally, the coding device may comprise one or more 

Examples of snch interfaces inclnde a disc drive for storing data on a computer-readable 
medinm 510, e.g. a floppy-disk drive, areadMrite CD-ROM drive, aDVD-drive, etc. Other 
examples inclnde a memory card slot a magnetic card reader/writer, an interface for 

accessing a smart card, etc. 

Correspondingly, the decoding device 505 comprises a corresponding receiver 
508 for receiving the signal transmitted by the transmitter and/or another interface 506 for 
reaving the coded stereo signal conmrunicated via the interface 504 and the computer- 
readable medinm 510. The decoding device timber comprises a decoder 507 which receives 
tire received signal T and decodes it into corresponding components U and R' of a decoded 
stereo signal. A preferred embodiment of snch a decoder according to the invention was 
described in conneotion with fig. 4 above. The decoding device further comprises an output 
unit 512 for outputting the decoded signals which may subsequently be fed into an audro 
player for reproduction via a set of loudspeakers, or the tike. 

It is noted that the above arrangements may be implemented as general- or 
special-purpose programmable microprocessors, Digital Signal Processors (DSP), 
Application Specific Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field 
Pmgrammable Gate Arrays (FPOA), special purpose electronic circuits, etc., or a 

combination thereof. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. 

For example, the invention is not limited to stereophonic signals, but may also 
be applied to other multi-channel input signals having two or more input channels. Examples 
of such multi-channel signals include signals received from a Digital Versatile Disc (DVD) 
or a Super Audio Compact Disc, etc. 
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In fee claims, any reference signs placed between parentheses shall not be 
construed as limiting the claim. The word "comprising" does not exclude the presence of 
elements or steps other than those listed in a claim. Tne word "a" or "an" preceding an 
element does not exclude the presence of a plurality of such elements. 

The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed compute, In the device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate mat a combination of these measures cannot be used to 
advantage. 



